Towards a distributed Arabic OCR based on the DTW algorithm: performance analysis
نویسندگان
چکیده
In spite of the diversity of printed Arabic optical character recognition products and proposals, the problem seems to be not yet well solved. The complex morphology and calligraphy of the Arabic writing on one hand and the use of some light approaches on the other hand are behind the poorness of these products. However, some strong proposed approaches didn’t find the opportunity to be commercialised because of generally their corresponding complex computing. The dynamic time warping algorithm is considered as one among these strong approaches. In fact, several studies and experiments have shown and confirmed that the printed Arabic optical character recognition based on dynamic time warping algorithm provides a very interesting recognition rate especially for large and huge vocabularies. One of the attractive sides of the dynamic time warping algorithm is its ability to recognize properly connected or cursive characters (words or sub words) without prior segmentation. Furthermore, this algorithm performs the recognition process from within a reference library of isolated characters and owns a very good immunity against noises. Unfortunately, the big amount of its computing during the recognition process makes its execution time very slow and, hence, restricts its utilization. Many researchers attempted to speedup the execution time of this algorithm. Unfortunately, the corresponding proposed solutions require generally specific high cost architectures. Loosely coupled architectures such as grapes or grid computing can provide enough power without additional cost to distribute the complexity of some greedy applications. Consequently, we report in this paper the performance analysis of an analytical and an experimental study of a distributed Arabic optical character recognition based on the dynamic time warping algorithm within loosely coupled architectures. Obtained results confirm that loosely coupled architectures and more specifically grid computing present a very interesting framework to speedup the Arabic optical character recognition based on the dynamic time warping algorithm.
منابع مشابه
Dynamic Time Warping Algorithm with Distributed Systems
Distributed computing is the method of splitting a large problem into smaller pieces and allocating the workload among many computers. These individual computers process their portions of the problem, and the results are combined together to form a solution for the original problem. At present, Distributed computing systems can be broadly classified into two methods, namely Grid computing and V...
متن کاملArabic Cursive Characters Distributed Recognition using the DTW Algorithm on BOINC
Volunteer computing or volunteer grid computing constitute a very promising infrastructure which provides enough computing and storage powers without any prior cost or investment. Indeed, such infrastructures are the result of the federation of several, geographically dispersed, computers or/and LAN computers over the Internet. Berkeley Open Infrastructure for Network Computing (BOINC) is consi...
متن کاملA P2p Grid Architecture for Distributed Arabic Ocr Based on the Dtw Algorithm
Arabic cursive optical character recognition (OCR) based on the dynamic time warping (DTW) algorithm provides simultaneously very interesting segmentation and recognition rates. However, the computing complexity of the DTW algorithm restricts its widespread utilization and its consideration at a commercial scale. Accelerating the DTW execution time has attracted many researchers and several sol...
متن کاملGrid ’ 5000 Based Large Scale OCR Using the DTW Algorithm : Case of the Arabic Cursive Writing
Large scale optical character recognition (OCR) refers to or means the computerization of large amounts of documents such as news papers. Despite the diversity of commercial OCR products, this task still remains too far from the mature especially if the input documents are insufficient quality or cursive writing such as the Arabic documents (Vinciarelli, 2002). Indeed, in their project (Holley,...
متن کاملPerformance Evaluation of the distributed Arabic cursive characters recognition using the DTW algorithm on the SRTG
Arabic printed cursive characters Recognition using the Dynamic Time Warping (DTW) algorithm provides very interesting results. Unfortunately, the big amount of computing to be achieved by this algorithm during the recognition process makes its execution time very slow. Grid computing presents a very interesting infrastructure that allow to support distributed applications in one hand and to ta...
متن کاملTowards an Optimal Utilization of Volunteer Grid Computing: a Comparative Study of Three Heuristics
Volunteer Grids present very interesting and attractive infrastructures that reduce, drastically, the response time of several greedy algorithms and applications such as the Arabic OCR (Optical Character Recognition) based on the Dynamic time Warping (DTW) algorithm. Intensive experiments performed on such infrastructures confirm their ability to provide enough computing and storage powers whic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 6 شماره
صفحات -
تاریخ انتشار 2009